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Abstract 


Digital tiansmission stoiage and repioduction of audio signals as effectively as possible 
needs idaptation to the peiceptual characteristics of the human auditory system Tins is 
best accomplished m the fiequency domain A Tiansfoim Coder based on the Distntt 
Founei Tnnsfoim is implemented Peiceptual pammeteis lelevant to low bit latt ( odmg 
aie computed for a segment of sampled audio signal 
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Chapter 1 


Introduction 

1 1 Need Of Low Bit Rate Coders 

Digital sound piocessmg and storage adopted m audio aie providing excellent sound quil 
it} Houeiu conveitmg an audio signal to a 16 bit digital format with appiopnah lcdun 
danr> foi cuor correction and with a minimum sampling l ate requires extremch extmdrd 
bandy ldtli foi signal transmission and storage the latter coupled with huge m iss stoi ig( 
necessities The large bandwidth results in problems for ladio transmission in paiticulu 
so them is considerable interest m avoiding anj redundancy m the signal oth< i thin foi 
enoi collection puiposes To achieve sound tiansmission or leproduction that is nol oub 
\ei\ good but also efficient all equipment has to be adapted to the chaiuctensLu s of flit 
final luenei m this case the human auditoiy svstem Anj part of the transmitted sign il 
that is not recognized by the auditory system shows bad matching to the recerveis and 
pioudes necessary redundancy Theiefoie digital rudio codeis aie designed to miner flu 
redundancy and nielevancy m a signal for the puipose of bit late i eduction Tli it is digit il 
wavefoim codeis[4] seek to be m a region where digitized signals aie perceptually ltlc vuit 
and statiscally nonredundant 

Low bit late coders if available with the lequisite performance could be ustd m i 
number of places such as for lemote broadcast lines studio links satellite transmission 
of high quality audio and the like For instance a channel with 15 KHz bandwidth 111 it 
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is peiceptualh tiansparent would be \er\ useful to broadcast media who could use it to 
pio\ lde a remote lmk fiee of the noise and frequency response pioblems noimrlh assoc latcd 
with leised and/oi equalized lines 

Focus on widei bandwidth signals of 15 I\H/ 01 20 KHz has centaed prim uil\ on bit 
late 1 eduction techniques The perceptual Tiansfoim Codei [3] descnbed m this thesis uses 
a laige encoding delay and has considerably gieater complexity but piovides transp uent 
coding of r wide class of 15 KHz bandwidth audio signals at 4 bits pei sample 

1 2 Overview Of The Thesis 

In this thesis we hare attempted to gam an undeistandmg of Perceptual Tiansfoim Coding 
concept by implementing a DFT based Tiansform Coder and computing some of the 
rclcr mt peiceptual parameters of a segment of sampled audio data The coding algorithm 
is based on Johnston s papei[3] 

In the next chaptei we leview the ISO/MPEG Audio standard and chaplei 3 unews 
about the basic psychoacoustic piopeity exploited m peiceptual audio coding Chaptei 4 
describes the implementation of DFT based Tiansform Coder Chapter 5 discusses chief 
tions m Digital Audio Broadcastmg(DAB) 
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Chapter 2 


ISO/MPEG Audio Standard 

2 1 MPEG Audio Compression Standard 

Digit rl cockcl audio lequinng low bit lates has led the IS O/IEC (International Organization 
foi Standaidization/Intei national Electiotechmcal Commission) standardization bod\ to 
estiblish MPEG(Movmg Pictuies Evpoits Group) standards for audio and -video <odin„ 
MPEC Midio standard specifies the coded representation ol high quality audio for stoi i^e 
media and the method foi decodmg[8] The compression techniques use psycho re oustu 
models foi predicting the human auditory response to the noise that is mtioduced In the 
coding scheme Using these models the characteristics of the compression scheme e ui be 
changed cl) namicall} m older to minimize the audibility of these noise imp urnients 

The MPEC Audio standaid is mtencU d lor application to digital storage me di t sue h as 
Compact Disc(CD) Digital Audio Trpe(D AT) and magnetic haid disc The stoi lgc nit cha 
ilia's either be connected directly to the dccodei oi i ia othei means such as commumc itiou 
lines MPEG Audio standard is intended foi sampling rates of 32 KHz 44 1 I\H/ md 48 
KHz at bit lates m a lunge of 32 192 Kbits pei sec pei mono and 64 384 Kbits pe i sec 
pei stereo uidio channel 

Depending on the application three different layeis of the coding system with me ic rsmg 
codei complexity and performance can be used 

Layer I This layer contains the basic mapping of the digital audio input into 32 sub 
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bands fixed segmentation to foimat the data into blocks a psjchoacoustn modi 1 to 
cletummc the adaptive bit allocation and quantization using block comp mdin 0 aid 
foimattmg This layci is most appiopnate foi consumei applications such vs digital 
home lecordmg on tapes that is for those applications for which ven low d ita iat( s 
an not mandatory 

Layer II This layer mtioduces fuithci compression with lespect to La>ei I bj lcduud nice 
and liiclcv ance removal on scale factors and uses more precise quantization This 
lay ci has numerous applications in both consumei and professional vucho such is 
Digital Audio Bioadcasting(DA-B) recording telecommunication and mult unc dn 

Layer III This layei mtioduces increased furiuencv resolution brsed on a hybnd filter 
haul It adds nonumf 01 m quantization adaptive segmentation and enti opt coding 
of the quantized values for better coding efficiency This laver is most appiopnate 
foi telecommunication applications in particular with nnirowband ISDN and foi 
piolessional audio having high weights on very low bit lates 

2 2 Basic Structure Of ISO/MPEG/ 

Audio Encoder 

The bisic structure of a perceptual audio coding scheme[6] is shown m Fig 2 1 
Tig 2 1(a) illustrates the basic structure of ISO/MPEC/ Aurdio Encoder 
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Figuie 2 1 (a) Audio Encoder (b) Audio Decoder 


It consists of 

1 A. time frequency mapping (filtei bank) is used to decompose the input signal into 
subsampled spectial components Depending on the frltei bank used these ait called 
subband \alues oi frequent j lines 

2 The output of this filtei bank oi the output of a paiallel transform is used io 
calculate an estimate of the actual(time dependent) noise masking threshold using 
lules known from ps\ choacoustics 

3 The subband samples oi frequency lines are quantized and coded with the run of 
keeping the noise which is introduced by quantizing below the masking tlueshold 

4 A frame packing is used to assemble the bit stream which typically consists of the 
quintized and coded mapped samples and some side information such as bit allot i 
tion information 


5 









This thesis focuses on high frequency resolution which leads to onlj limited time resolution 
Such systems aie usualh called Perceptual Transform Coders 


2 3 Basic Structure Of ISO/MPEG/ 

Audio Decoder 

Fig 2 1(b) illustrates the basic structuie of ISO/MPEG/Audio Decoder Bit stream data 
is fed into the decoder The bit stream unpacking and decoding block does enoi detection 
if enoi chi tk is applied m the encoder The bit stienm aie unpacked to lecoiei the \ uious 
pieces of mfoimation The reconstruction block leconstiucts the quantized veision of tin 
set of mapped samples The frequency time mapping transforms these rnappi d simples 
bick into umfoim PCM 


Chapter 3 


Psychoacoustics And Perceptual 
Coding 

3 1 Auditory Masking 

Most of the time oui world presents us with a multitude of sounds sirnultanoush It is 
often difficult to heai one sound when a much louclei sound is present This pioass si ( ms 
mtutrw but on the psycho re oustic and cognitnc levels it becomes \eiv complix Tin 
teim foi this process is masking Masking is defined as the amount bj which flu thicsliolcl 
of uidibility foi one sound is raised by the presence of another (masking) sound [10] Tin 
basic idea m the psychoacoustic approach is to determine which pait of the noise is irnsl ( d 
by the sign al and which pait is audible 

Maslmg expenments using steady state signals have led to the conclusion flirt oui 
ruchtoi'v system perfonns a spectial analysis which can be modeled by a continuous filtei 
bank of bandpass filteis having a bandwidth of 100 Hz foi frequencies below 500 H/ mcl 
a bandwidth of approximately one sixth of the center frequency[7] above 500 Hz Tli it is 
oui auditor \ system analyses m parts that conespond to critical bands A. list of cnticrl 
band edge fieciuencies is given m Table 3 1 
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Band No 

H. 7 

Lower Edge 

H / 

Center 

H 

Upper Edge 

H/ 

1 

0 

50 

100 

2 

100 

150 

200 

3 

200 

2^0 

300 

4 

300 

350 

400 

5 

400 

450 

510 

6 

510 

570 

630 

7 

630 

700 

770 

8 

770 

810 

920 

9 

920 

1000 

1080 

10 

1080 

1170 

1270 

11 

1270 

1370 

1480 

12 

1480 

1600 

1720 

13 

1720 

1850 

2000 

14 

2000 

2150 

2320 

15 

2320 

2500 

2700 

16 

2700 

2900 

3150 

17 

3150 

3400 

3700 

18 

3700 

4000 

4400 

19 

4400 

4800 

5300 

20 

5300 

5800 

6400 

21 

6400 

7000 

7700 

22 

7700 

8500 

9500 

23 

9500 

10500 

12000 

24 

12000 

13500 

15500 


Table 3 1 A List Of Critical Band Center And Edge Frequencies 
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Fiom the Table 3 1 it is obvious that one critical band is added to the next so that 
the uppei limit of the lover critical band conesponds to the lower limit of the next lnghci 
critical band thus producing the scale of the critical band late The critical bind 1 itc 
scale mentioned[l][5] follows a linear scale up to about oOO Hz and then a logaiithnnc 
frequency scale above 500 Hz 

Fig 3 1(a) shows the uncoiled inner cai including the basilar membrane It mdic ites 
that the cutical band late scale is directly lelated to the place along the bnsilai me nibiam 
when all the scnsoiy cells aic located m a veiy c quidistant configuiation Thus the c ntu al 
band late scale is closed i elated to oui physiology too Fig 3 1(b) and Fig 3 l(i ) shows 
the dependence of critical band late on fiequencj as illustrated The cutical band concept 
has led to the definition of the bark scale as the psychoacoustic equivalent of the hcqucnct 
scale 


3 2 Psychoacoustic Models 

The MPEG audio standard is able to maintain CD audio quality reproduction up to a 
compression ratio of 5 to 1 oi better This bit rate corresponds to encoding each audio 
simple with an aaeiage of 3 to 4 bits pei sample The MPEG coding scheme acini us this 
compression bv placing the quantization noise m those regions of the frequence spec ti mil 
wdieic the human ear is least sensitive 

The psj choacoustic model is used to calculate the perceptual thieshold for each time 
segment of the souice matenal This threshold which estimates the maximum noise tint 
can be m audibly inserted into the signal is used m the coder to maximize the amount 
of perceptually acceptable noise that can be inserted into a given signal segment A.s 
the amount of the quantization noise is directly related to the number of bits used b\ 
the quantizer the bit allocation algorithm assigns the available bits m a mannei which 
minimizes the audible distortion 


10 


Chapter 4 


Implementation Of Perceptual 
Transform Coder 

4 1 Why Transform Coder ? 

The Frequency Domain’ approach which exploits linear dependencies between input sain 
pies foi efficient digitization is c riled Transform, Coding In this thesis Transfoi m C oding 
is use d to iclneve the desired bit rate leduction foi it is moie lobust than DPC M ruth ie 
gaid to input statistics and channel eriors The puce paid for these advantages is me leased 
encoding delay which is proportional to tiansfonn length N and an encoding < omplcxitc 
m the case with adaptive bit assignment 

Fig 4 1(a) is a block diagram of transform coding It is i vaveform digitizing pioctduie 
whoie i block of N input samples x[n] is hneaily transfonned into a set of N tnnstoim 
coefficients 0[n] The cofhcicnts are then quantized foi transmission and i leconstiuc 
tion of \[n] is obtained at the leceiver using an inverse transform opeiation on qu uitized 
coefficients 

Fig 4 1(b) shows how the above coding scheme is implemented The input bio cl is 
high resolution PCM samples The output of the coder is the combination of the outputs 
of ’N PCM codeis that convey quantized uncorrelated coefficient information hpicallv 
with i much louei total bit rate than what is present at the coder input 
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TRANSMITTER CH \NNEL RECEIVER 



Figuic 4 1 (a)Transform Coder (b)Implementation of Encoder 

The efficiency of a transfoim coding system w ill depend on the tvpe of lineal ti uisfoim 
and the nituie of bit allocation foi quantizing transfoim coefficients Most pi ictic d s\s 
terns will be based on Disciete Fouiiei Transfoim(DFT) approach 
DFT is defined by 


N - 1 


6[I] = £x[n]exp-^ fc,/ ' A = 0 1 2, N- 1 

n= 0 

^[n] = l/NY; Q[k] exp- jW? n = 0 12 N -1 


( 11 ) 

( 12 ) 


DFT lepiesentations hare tno impoitant advantages 

• DFT makes of sine cosine basis space Hence it piovides a natuial fi amen oik foi 
optimizations of transfoim coder design with inputs where the perceptual effects of 
signal distortion are best understood m the frequency domain 
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• DFT is advantageous with xespect to computation also A. dnect evaluation of an \ 
point DFT lequnes about N 2 complex multiply add opeiations but if N is i povu 1 of 
2 the Fast Founei Tiansfoim(FFT) method needs m the ordei of iVlog/fopei itions 
With N=1024 the FFT is about one hundred times fastei than a dnect coinput it ion 
In the case of leal valued inputs fuithei reductions are possible by exploiting the 
piopeities of the DFT 

• Tin FFT nlgonthm can also be used to e\aluate the imeise operation 

4 2 Implementation Of Transform Coder 

A blocl diagiam of the Transform Codei as implemented using Matlab package on Pph 
tiurn PC (Q 133 MHz, is shown m Fig 4 2 The souice material is 16 bit PCM monophonic 
audio siginl ol 15 KHz bandwidth sampled at 32 KHz The window tint is used m 
the overlap sections is the sqauie loot of a Hannmg window of length equal to twice tlu 
oveilap that is 256 data points since the windowing process for oveilap add and unhsis 
pi o\ ides foi an oveilap add of Kth This overlap add window is shown m Fig 4 3 loi time 
successive frames wheie each frame contains 2048 data points By allowing this cneilip 
between (lie successive flames reduction of end effect noise is achieved without signifi- 
cantly lowrenng the number of bits available for encoding each frame With the ovcihp of 
Ath the numbei of new data points m each fiame is 1920 

The FFT is implemented dnectly on the windowed data The 2018 leal input d ita tic 
conveited to a spectium of 1024 complex points rountmg the dc and Nyquist frequency 
as one complex point The spectral peaks of each 8 complex points are cxlcuUtecl Since 
the particular transform is a real complex tiansform of length 2048 theie are 1024 unique 
complex pans and 128 spectral peak values These spectral peak yalues aie quantized 
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TO CHANNEL 


Figuie 4 2 Block Diagram Of Transform Coder 
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Sample n mber 

Figuie 4 3 Overlap add window 

using 8 bits wheie each step size m the quantizer represents — = 0 664 dB The uason 
for choosing the numbei 170 is as follows 

The size of the quantizei is dcteimmed by the possible dj namic lange of the 20 18 point 
spectium given a 16 bit PCM input souice In a tiansfoim the total specti d (hn unit 
range can vaiy from that of a unit impulse here set to quantizer level 1 to the spectial 
energy of i full scale smewave loughly 167 dB highei hence the total range of 170 dB 
Then the numbei of bits ivailable for coding the samples aie calculated 
It lequues the calculation of the following 

• The total numbei of bits available m a block of data 

• The numbei of side mfoimation bits required The side information is used m both 
the ti ansmittei and leceivei to calculate bit allocation 
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Since ( ach block contains 1920 new data samples of 8 bits per sample the total numbn 
of bits i\ ulable comes out to be 15360 
The side information consists of three paits 


• A. 16 bit PCM encoding of the clc teim fiom the FFT 


• The thieshold is encoded m 8 bit PCM where each step size is 0 664 dB 


• The 128 spectral peaks are encoded in 8 bit PCM using step size of 0 664 dB 

By t iking into consideiation the above details the numbei of bits lequired toi sick mfoi 
mation is calculated to be 1040 This numbei is then subtricted from the total numbei of 
available bits to get the number of bits available for coding the samples 

Aitei cilcuhtmg the actual number of bits available vluch is 14320 bits the numbei 
of levels in c ach set of quintizeis is found by iterative procedure bv calculating 

k L = 2 * mnt(P l /Thr J ) + 1 (4 i) 

wheic 

k t = numbei of levels m each set of quantizers 

P = quauti/ed peak level 

Thij = quinti/ed thieshold value 

mnt = nearest mtegei function 

j = Bark frequency, varying fiom 1 to 24 

In this thesis thieshold is initially assumed to be a constant and the numbei of 1< \c Is m 
each set of quantizers is then calculated From the number of levels the numbei of bits 
is calculated and compared with the total numbei of bits available to encode a block of 
data samples If both numbers are not equal the threshold is adjusted accordmgh and / 
is computed This process is repeated until all available bits have been used 
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The quantized samples along with the side information has to be transmitted to the 
receiver and rcconstiuction of the input signal x[n] is obtained at the receiver using inverse 
transform operation on quantized samples and the results windowed with the same window 
used for analysis Fig 4 4 shows the reconstructed signal of Transform Coder for 16 bit 
PCM input and Fig 4 5 shows the 8 bit PCM Coder output for the same input 




Figure 4 4 (a) 16 bit PCM input (b) Transform Coder output 

Listei mg to the reconstructed audio output reveals that while the 8 bit PCM coded 
segment is noticeably noisy the 8 bits per sample transform coded segment cannot be 
distinguished fiom the original 

Also to compare their performance, Signal-to Noise ratio(SNR) is computed by first 
finding their respective quantization errors SNR computed m decibels for a block of input 
audio signal is as follows 

SNR = 19 74 dB For Transform Coder 

SNR = 14 95 dB For 8 bit PCM coder 

' ' CENTRAL LIBRARY 

I 1 T KANPUR 






Figuie 4 5 (a) 16 bit PCM input (b) 8 bit PCM Coder output 


The Tlneshold v dm used in Eqn 4 3 can be calculated using peiceptuil noise niton 
Seven! steps arc involved m calculating the masking threshold of which Critical Bund 
Analysts using Baik spectrum and Spectral Flatness Measure are discussed 

4 3 Critical Band Analysis 

The block chagiam of Perceptual Transform Coder is shown in Fig 4 6 
From the 1024 unique complex pans computed eailiei (by windowing the input signal and 
processing by FFT), the powei spectrum is calculated using 

P(u>) = i?e 2 ( u) + Irnipj) (4 4) 

The pov ex spectium is shown m Fig 4 9 The spectrum is then partitioned into mtic al 
bands accoidmg[3] to Table 3 1, and the energy in each cntical band is summed 
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riguic 4 6 Block Diagram Of Perceptual Transform Coder 
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( 15 ) 


bh 

A = £ P(u) 

oj=bl 

whcie 

bl = lowei boundaiy of critic xl band 1 
bh t = uppei boundaiy of cutical band 1 
= cneigy m cutic al band 1 



Frequency in Hz 

Figuio 4 7 Power Spectrum 

Fig i 10 shows powei spcctium uid critical band spectrum (Bark Spectrum) foi a 
block of input signal As discussed m section 3 1, entic'd band rate scale follows i linen 
scale up to about 500 Hz and then a loganthmic frequency scale above 500 Hz 
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Figure 4 8 Power and Bark Spectrum 

4 4 Spectral Flatness Measure 

In oidei to determine lire noise like 01 tone like nature of the signal, the Spectral Flatties 
Measwe(SFM) is used The STM is defined as the ratio of the geometric mean(Cni) of 
the powci spectrum to the anthmetic mean (Am) of the powei spectium 



SFM — N 

hm N -+ ml jj(Zk=iPk) 


(1G) 


This iatio, by definition is not gieitci than unity SFM in decibels is given by 

SFM iB = 101og I0 ^ (4 7) 

SFM is then used to generate a coefficient of tonality a, to determine whethei the signal 
is entnely tonehke 01 noise like a is given by 
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n = m/v 


1 


( SFM dB 

\&ru , 


dBmar 


(4 8) 


With SrMdBrnax = -60 clB SPM obtained with thiec successive frames of 1G bit PCM 
monophonic uiclio signal ol 15 KHz bandwidth sampled at 32 KHz axe as tabulahd as 
follows 


Frame No 

SFM clB 

a 

Tonal 

/nontonal 

1 

35 577 

0 59 

Tonal 

2 

23 560 

0 39 

Tonal 

3 

32 021 

0 53 

Tonal 


1 able 4 1 Tonality Measure 


The complete code (hat is will ten using Matlab Package on Pentium PC <S 133 MHz 
foi implement ation ol the H msloirn Coda and foi computing the Spectral Platnes Me a 
suie and Bailc Spcctium is given in Appendix 
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Chapter 5 


Future Scope Of The Thesis 

5 1 Scope Of The Thesis 

The Pi ansfoim Coders using pnccptvnl noise intern for efficient coding of steicoplionu 
audio sign ils is undciwiy More lccent vvoik suggest that the Perceptual Transform Goch i 
could lx used d i lowci bit 1 ate ol 3 bits pei sample foi transparent coding of sigml 
souiccs E\ amin il ion of sonic 20 IvII/ buidwidth signals sampled at 44 1 KHz suggest 
tint the same oi slightly fcwti bits pci sample should provide transparent coding Tin 
lenson is tint the lower enoigy at vciy high fieciuencies requires fewer than aveiag( bits 
foi the additional bandwidth li suiting m the lowei rate measured m bits per sample 
Woik x c 1 ate cl to men iseel coding (fhuency, efhucnt steieo encoding and rnoie effectm 
conipicssum/p ukmg algontlims is undciwiy 

5 2 Introduction To DAB 

5 2 1 Why DAB System*? 

The use ol digit al transmission tec lmiques for the teiiestml broadcasting of radio piogi arm 
was previously pie vented by the laigc bandwidth requirements that is, high fieciuenci elc 
mands As a ic suit of modem audio compression techniques which exploit the pstchou 
coustic piopcities of the human eai the present state of the ait is capable of achienng 
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consicki iblc ic due lions m ilu bit late ti xnsmittcd Thorefoie an oppoi turnip will ame loi 
tiuismittmg i digit dly tin ode d stereo piogrun m an FM channel bandwidth flic mini 
objective ol DAB(Digit d Audio Bioidc istmg) is to iclnevc enoi fiee signal ti insmission 
with be ltd uicho quility uid i uiniimum use ol b indwidth Audio quality must In ui 
impiovcmcnt ova tlu bisic chu \cknstics(such is noise and distoit ion) of FM w liosc 1 idio 
channels aie highly imp tiled by multipath lecoption Then foie DAB would be an efficient 
succ essoi ol the teiiostnul FM bioaclcasts b} occupying the VHF band of 87 5 to 108 
MH/ without losing [9] tlu c ipacity to bioadcist the existing numbei of FM piogiams 
R aclio cli mne Is using digit il ti insmission techniques wall peimit the opeiation of single 
hcquciK y lutwoiks m tlu lutmc These digital single frequency netwoiks will peimit 
subst mil il lieepu ne y e e emerniy Bio ule istmg a l aclio piogiam throughout a se n ice ue i 
only le ciunes one single ti uismissiem cliinii(l[2] oi i put ol a coriespoiidmg multiplex 
siginl The luga the se i vie c ue t, the less fie epic ncy volume is icquiled foi bio idc istmg i 
piogi un Foi this h ison DAB will lcciunc only about one third ol the dequeue \ aolume 
to bioukast progi uns, eompued with TM today In oidci to leuse frequence blocls as 
often ts possible uid due to ot lie l physic il conditions l elating to digital single fie quoin 
netwoiks fimsmilln powe l comp ue d with TM cm bo iceluccd considciablv Onh low 
truisimtto powo permits tlu lease ol the same fiequency block at distances tint aic not 
too gie at Moie ove i due ction il antennas with levels of up to 20 dB must be used at the 
edges ol seiuee ue is to liimnm/e ueluccd i idntion patterns 

5 2 2 Merits Of The DAB System 

• Tlu D\B single he queue y netwoiks will exploit the lcsouices of wireless timsmis 
sion with exli lorclin uy effie uiicy This applies m particular to progi ains which axe 
bicnckast nationwide oi lor luge seivicc areas 
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• DAB will ilso provide list cm is in 1 lu long turn with i much gieatei piogiAmniun 
hn t c 11 c si 11 il hi option with mobile uicl poit iblo lachos Full covemge bioulc istmg 
ol piogi mis tint in aim lice md of excellent quxlity lull be ensuiecl it the slut 
ol DAB Tint is m lut me, DAB will beeemie (he CD RADIO 
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Appendix 


/ Transform Coder 

N=2048 

m=input( Enter the number of frames ’) 
f idl=fopen( ’m txm J ’r’) 
dum = fread(fidl 20 ’ short ’) 
x = fread(fidl 2048*m ’short') 
xl=reshape(x N m) 

/ hanning window 

for cl=l 256 
g(cl)=cos (2*pi*c 1/257) 
h(cl)=sqrt(0 5*(l-g(cl))) 
end 

t=ones(l 1792) 
u=t(l ) 
w(129 1920)=u 
for c2=l 128 

w(c2)=h(c2) 

w(1920+c2)=h(128+c2) 

end 

c=l 2048 
for 1=1 m 
W(c)=w 

F(1 2048) =W(c) , 

K =xl(l N 1) *F’ , 

ft=ff t (K N) 

reft=ft 

recp=real(ft) 

imcp=imag(ft) 

ft (1) =f t (1) +f t (1025) *1 

sp=ft(l 1024) 

ft (1025 2048) = [] , 
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re=real(sp) 

im=imag(sp) 

/ Spectral peak calculation & Bit rate calculation 

sr=abs (re) 
si=abs(im) 
sql=0 0 
thj=7 02 
count=l 8 
for md=l 128 

srm=max(sr (count) ) 
sim=max (si (count) ) 
p(ind)=mnx(srm sim) 
pdb ( md) =20*logl0 (p ( md) ) 
pquan(md)=0 664*round( (pdb(md) /0 664)) 
pval(ind)=10'' (pquan(md) /20) 
ql ( md) =2*round (pval ( 3 nd) /thj ) +1 
nbt(ind)=round(log2(ql(ind))) 
count=count+8 
end 

jj= 1 16 
for n=l 128 
for jl-jj 

rec(jl)=nbt(n) 

end 

J J=J J+1-6 

end 

sql=sum(ql) , 
totb=sum(rec) , 
while totb > 14320 
thj=thj*l 005 
for idl=i 128, 
ql(idl)=0 0 
nbt(idl)=0 0 
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end 

for idl=l 128 

ql ( ldl) =2+round (pval ( ldl ) /thj ) +1 
nbt (idl) =round(log2(ql(idl) ) ) 

end 

j 2=1 16 

for il=l 128 
for j3=j2 

rec(j3)=nbt(il) 

end 

j2=j?+16 

end 

sql=sum(ql) 

totb=sum(iec) 

end 

while totb < 14320 
thj=thj/l 002 
for id2=l 128, 
ql (id2) =0 0 
nbt (id2)=0 0 

end 

for id2=l 128, 

ql ( id2) =2+round (pval (id2) /thj ) +1 
nbt (id2) =round(log2 (ql (id2) ) ) 
end 

j 4=1 16 
for i2«l 128 
for 35=34 

rec(35)=nbt(i2) 

end 

34=34+16, 

end 

sql=sura(ql) 
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totb=sum(rec) 


end 

lev=2 rec 
i 3=1 16 
for 3.4=1 128 
for 3.5=33 

nlev(i4) =max (rec (x3) ) 

end 

i3=x3+16 

end 

for i6=l 128 

st,ep(x6) = (2*pv xl ( i6) ) /? (nlev(i6)) 
end 

j 6=1 16 

for xlO=l 128 
for j f=j6 

ssiz(j7)=sfcep(ilQ) 

end 

J 6= j 6+16 
end 

retrs=recp' , 
lmtrs=lmcp , 
for x7-l 2048 

requan(x7)=ssx/(3 J) around (retrs(i7)/ssiz(i7)) 
xmquanCx O a S8iz(x7) *round(xmtrs(i7) /ssiz(i7)) 
end 

for xll=l 2048 

redat(xll) = (requan(ill)) + (imquan(ill)*i) , 
end 

redatt=conj (redat ' ) 
s=xfft(redatt ,N) 
e=real(s) , 
rw=e *F’ 
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fid 2 =fopen( ’ L c dU’ ’V) 

count! =iprxnt i (( Ld2 ’7b 4f\n rw) 

rwt=rw’ 

/ 16 bil PCM to 'b’ bit PCM conversion 
b= input (’Lnt oi Llio number oX bits ’) 
step= (2*32767) / (2~b) 
for il2=l 2048 

bit8(i! 2) =step*round(xl (il2 l)/step) 
end 

bitt=bit8’ 

f id3=J opon ( ' c odor d it ’ ’ i ’ ) 
count2 ! =fpt mt ! (I hH '7b 41 \n’ bat L) 

'/pause 
c=^ 1920 

'/sound (xl 12000) 

/pause (b) 

/sound (rw 32000) 
f close ('ill ’) 

[ 

figured ) 

subplot (1 2,1) pi o t (xl (1 2048 1)) 

xlabel( ’Sample Numbpt’) ylabel (’Amplitude’) 

subplot (1 2 2) plot(rw(l 2048)), 

xlabel ( ’ Samp lo Number ’ ) , y Labe L ( ’ Amplitude ’ ) , 

figure (2) , 

subplot (1,2 1) plot(x't(l 2048,1)), 

xlabel ( ’ Sample Number ’ ) ylabel ( ’ Amplitude ’ ) 

subplot ( 1 2 2) ploKbittU 2048)) 

xlabel ( ’Sample Number’) ylabel (’Amplitude’) 

'/.print txcod -ffigured) -deps2 
'/pnntf cod8 -fftgure(2) ~deps2 
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/ Ft i c op lull Iransform coder 


N=2048 

fs=32000 

m= input (’bn ter the number of frimos >) 
fidl=fopen( ’m tun' ’i 1 ) 
dura = freadClidi 20 'fahort’) 
x = fread(iidl 2048+111 ’shot!’) 
xl=reshape(x N m) 

'/ huininp window 

fox cl-J 2% 
g(cl)=cos (2+pi *< 1/ , h/') 
h(cl)=sqrfc(0 r >KI p(t l))) 
end 

t=ones(l lf"9?) 
u=t(l ) 
w(129 1920)~u 
for c2=l 128, 

w(c?)-hCc >) 
w(1920t C 2)=h(128+e2) 
end 

c-1 2048, 

'/t ££ t if pad t ale u J it ton 
for 1=.! m 
W(c)-w, 

F(1 2048)-W(<) 

K «xl(i N, 1 ) *1 1 
ft-ff t (K, N) , 

ft(i)-ft(i)+rt(io?b)*x, 
sp=ffc(l 1024) 
ft (1025 2048) 3 [1 , 
freq=16000* (0 1023)/J024, 
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fqt-iicq' 

re=rodl ( p) 

im=imaf Op) 

ypw=(3G) 2 t (im) 
pw=(sp +coni( p))/lcii}>tm p) 
pwdb= 10+1 op, 10(pw) 
fxd2=fopcn( 'pw (l it *w‘) 
fprmfcf (fid2 ”/'> it \t\t/6 H\n' (qt pw) 
figure (1) 
plot (f icq pwdb) 

xlabolC'I rt quoiuy inti ) y I ib< 1 ( Towci m db') 

*/ c lit u 1 it ion ol ( B am tpy 

fid3~fopon( 1 1 im dil 1 i ’ ) 
lt-fsranl 0 lcH ! */,dd ) 
c3»l ? 
cnt=lt(c3) 

BazerosO )A) 
slp=0 0 

fid4»fopen( * tqJ im du ’ ’j ) 
llt-fscdni (f u!4 ’ */ *>d ’ ) 
cc3=3 ?, 
ccnt«llt (tc I) 

for c 4 l M, 

for cb cut (1) cnt(2) , 
sip sLptpw(cb) 
ond 

for cc r > ccnt(i) ccnt(?) , 
bk(ccb) -sip 
end 

cc3=cc3+l 
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ct nl III (cc 1 ) 

B(t4) Ip 
Ip 0 0 
< 3-< 3 1 1 
cnt L t ( c R') 
c nl ( l ) t n t ( 1 ) i l 
c4“t 4 1 1 

end 

bdb“l(Hlop 10 (bk) 
figutcC “>) 

pi o l (it luj pwtlb ’y ’ 1 IbbOO bdb ’r-') 

gtoxt (’Power Spot t 1 urn') ft ox t (’C-Uk Spectrum’) 

xJibal (M inquowy in II ’) y Libel ('Powei m db’) 


'/. c iLculalion of SFM 

dav=l/lenpt h(pw) 
gm=0 0 

for cc8~i 10 M 

gm-gm+iO* log 10(pw(cc8) ) 
end 

Gin=dj.v+gm , ’/ Geometric mean 

Am«10*logJ0(nKi ui(pw) ) */, Anthmetric mean 

SFM a Gm" Am 

aLfa=min ( (SI M/( 00)) 1) ’/ tonality index 

fclosoC’all’ ) 
c a c+192Q 
i 

figure (3) 

plot(l 2048, w J y * 1921 3968,w ’y~’,3841 5888,w ’y-O 
xlabel ( J Sample number ’ ) , ylabel ( ’ Amplitude ’ ) , 

'/print psp -ft igure(l) -d9pa2, 
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/print bsp -f figure (2) -deps2 
/print win -f figure (3) ~deps2 


/ SNR Calculation 

fidl=fopen( ’m tim’ ’r’) 
dum = fread(fidi 20 short’) 
x = fread(fidl 8000 ’short’) 
fid2=fopen(’tc dat’ ’r’) 
tout=fscanf (f id2 ’/f') 
fid3=fopen( ’ coder dat’ ’r ) 
codout=f scanf (f id3 ’’/ f’) 
ltc=length(tout) 
mps=x(200 ltc) ~1 
avm= (sum(mps) ) / (ltc-200) 
qetc=x(200 ltc) -tout (200 ltc) 
sqtc=qetc "2 

avtc=(sum(sqtc) )/ (ltc-199) 
snrval=avm/avtc 
snrtc=10*logl0 (snrval) 
qecod=x(200 ltc)-codout(200 ltc) 
sqcod=qecod "2 

avcod=(sum(sqcod) )/(ltc-199) 

snrcod=avm/avcod 

snr8=10*log!0(snrcod) 
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